NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning

Wang, Chendi; Zhu, Yuqing; Su, Weijie; Wang, Yu-Xiang (July 2024, International Conference on Machine Learning (ICML-24))

A recent study by De et al. (2022) has reported that large-scale representation learning through pre-training on a public dataset significantly enhances differentially private (DP) learning in downstream tasks, despite the high dimensionality of the feature space. To theoretically explain this phenomenon, we consider the setting of a layer-peeled model in representation learning, which results in interesting phenomena related to learned features in deep learning and transfer learning, known as Neural Collapse (NC). Within the framework of NC, we establish an error bound indicating that the misclassification error is independent of dimension when the distance between actual features and the ideal ones is smaller than a threshold. Additionally, the quality of the features in the last layer is empirically evaluated under different pre-trained models within the framework of NC, showing that a more powerful transformer leads to a better feature representation. Furthermore, we reveal that DP fine-tuning is less robust compared to fine-tuning without DP, particularly in the presence of perturbations. These observations are supported by both theoretical analyses and experimental evaluation. Moreover, to enhance the robustness of DP fine-tuning, we suggest several strategies, such as feature normalization or employing dimension reduction methods like Principal Component Analysis (PCA). Empirically, we demonstrate a significant improvement in testing accuracy by conducting PCA on the last-layer features.
more » « less
Full Text Available
Learning with centered reproducing kernels

https://doi.org/10.1142/S0219530523400018

Wang, Chendi; Guo, Xin; Wu, Qiang (April 2024, Analysis and Applications)

Kernel-based learning algorithms have been extensively studied over the past two decades for their successful applications in scientific research and industrial problem-solving. In classical kernel methods, such as kernel ridge regression and support vector machines, an unregularized offset term naturally appears. While its importance can be defended in some situations, it is arguable in others. However, it is commonly agreed that the offset term introduces essential challenges to the optimization and theoretical analysis of the algorithms. In this paper, we demonstrate that Kernel Ridge Regression (KRR) with an offset is closely connected to regularization schemes involving centered reproducing kernels. With the aid of this connection and the theory of centered reproducing kernels, we will establish generalization error bounds for KRR with an offset. These bounds indicate that the algorithm can achieve minimax optimal rates.
more » « less
Full Text Available
Binary Classification under Local Label Differential Privacy Using Randomized Response Mechanisms

Xu, Shirong; Wang, Chendi; Sun, Wei; Cheng, Guang (October 2023, Transactions on Machine Learning Research)

Full Text Available
Binary Classification under Local Label Differential Privacy Using Randomized Response Mechanisms

XU, Shirong; Wang, Chendi; Sun, Will Wei; Cheng, Guang (October 2023, Transactions on machine learning research)

Full Text Available
Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms

Li, Ximing; Wang, Chendi; Cheng, Guang (May 2023, ICLR 2023)

Full Text Available
Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms

Li, Ximing; Wang, Chendi; Cheng, Guang (January 2023, International Conference on Learning Representations)

Full Text Available

Search for: All records